
Replication Data for: Sustainable and Inclusive - Evaluating urban sustainability indicators: suitability for measuring progress towards SDG-11
These files allow replication of the analysis for Thomas, Ryan, Hsu, Angel, and Weinfurter, Amy. Under review. __Sustainable and Inclusive - Evaluating urban sustainability indicators' suitability for measuring progress towards SDG-11__. _Environment and Planning B: City Science and Analytics_.

Data were collected and stored in a Google Sheet prior to publication. All analysis was conducted in the R programming language. Replication code for figures representing the bulk of the analysis in the code file, described below. Please see the full publication (citation above) for a description of the methods.

# Code file
+ indicator_paper_graph_plot_share - This file reads in the data files and recreates the plots used in the publication. It could be edited to do additional analysis and locate metadata of the indexes and indicators reviewed in the publication.

+ To load the custom theme used, install UESIplots package in R with the following command:
devtools::install_github("datadrivenyale/UESIplots")

# Data file descriptions
There are five data files included: indicators.csv, index_countries.csv, edges.csv, nodes.csv, and sdg_labels.csv. Below is a metadata description of each file and its columns. Files are ordered in terms of relevance to the findings of the paper.

## 1 indicators.csv
Main data set for analysis of indicators. This data set collates indicator metadata from the selected indexes for quantitative analysis of counts and distributions.
+ Index* - Name of the index that calculated the indicator
+ Sector - Sector of the organization that created the index
+ Indicator* - Name of the indicator
+ Issue Area - Qualitative coding of most relevant environmental issue
+ SDG_11 - Qualitative coding of most relevant Sustainable Developments Goal 11 target or indicator
+ SDG_11 (Target or Indicator) - True/False of whether the SDG_11 field refers to a target (formatted like 11.x) or indicator (formatted like 11.x.1, 11.x.2, etc.)
+ Units* - Indicator units
+ Unit type - (Factor) one of three values: "Multi-modal", "Single - Absolute", "Single - Percentage", or "No Units". These refer to the units in which the indicator was reported. Multi-modal refers to indicators that had multiple raw data sources, and thus multiple modalities of units. Absolute and percentage refer to scientific units (e.g. parts per million for air pollutants or meters to a transit stop) and percentages of scientific units, respectively. NA means not available.
+ Unit Scale - (Factor) scale of the indicator, such as city-level, neighborhood, etc.
+ Component*, *** - Most detailed level of hierarchical organization for index; these are often used to relate indicators to index themes.
+ Theme*, *** - Second most detailed level of hierarchical organization for index; these are often used to relate indicators to index themes.
+ Indicator description* - Additional information on the indicator provided by the index documentation.
+ Equity - Whether or not this indicator could be used to assess equity (regardless of whether or not it was actually used in this way)
+ Data Source Notes** - When provided, relevant details about the data source used to calculate the indicator.
+ Target Quality - The typology of the targets and baselines, as described in the paper. These are one of: "No Target", "Baseline Only", "Target but no Baseline", "Directional (increase/decrease)", or "Target with Baseline".
+ Methodology Notes** - When provided, relevant details about the methodology used to transform data described in `Data Source Notes` field into the indicator.
+ Open Data - When available, True/False of whether the data source is openly available.
+ index_url - Link to index or indicator metadata. May be used to locate metadata of the indexes and indicators reviewed in the publication.

## 2 index_countries.csv
Simplified edge list for index network without spatial coordinates; supply to `edges=` argument in `igraph::graph_from_data_frame()` R function
+ origin - Country location of the index organization
+ scope - Country location of regions and cities included in the index listed in `index` field
+ index - Name of the index
+ sector - Sector of the index organization
+ scope_group - Country classification of the scope country
+ origin_group - Country classification of the origin country

## 3 edges.csv
Full edge list for index network with spatial coordinates.
+ from - ID of origin country
+ to - ID of to country
+ index - Name of the index
+ sector - Sector of the index organization
+ scope_group - UN 2014 Country classification of the scope country
+ origin_group - UN 2014 Country classification of the origin country
+ from_name - Origin country
+ from_group - Duplicated origin_group
+ from_geometry - Well-known text (WKT) point of origin country
+ from_lon - Longitude of origin_name country centroid
+ from_lat - Latitude of origin_name country centroid
+ from_id - Unique ID for origin Country
+ from_weight - Number of connections between origin and scope countries
+ to_group - Duplicated scope_group
+ to_geometry - Well-known text (WKT) point of scope country
+ to_lon - Longitude of `scope_name` country centroid
+ to_lat - Latitude of `origin_name` country centroid
+ to_id - nique ID for scope Country
+ to_plot_name - Scope country name cleaned for plotting


## 4 nodes.csv
Detail of countries included in network; supply to `nodes=` argument in `igraph::graph_from_data_frame()` R function
+ name - Country name
+ group -  UN 2014 Country Classification of country in `name` field
+ geometry - Well-known text (WKT) point
+ lon - Longitude of country centroid
+ lat - Latitude of country centroid
+ id - ID of the country
+ weight - Number of connections between country and all others
+ plot_name - Selected names used for labeling most frequent countries

## 5 tableS1.csv
Replicated table from publication describing each reviewed index and the number of indicators included in the analysis
+ Index - Name of the index
+ Indicators - number of indicators reviewed
+ Description - Text description of the index, usually copied from the website or document of the index.

### Notes on metadata:
* Verbatim from index documentation. Used in qualitative coding.
** Combination of verbatim from the index documentation with some clarifications and additions from data collection team. These were not used in quantitative analysis. Used in qualitative coding.
*** Most (all?) indexes use a hierarchical organization pattern to group indicators by subject area into themes and components of those themes. Used in qualitative coding. These informed our coding of the `Issue Area` field.

# Recommended Citation
Thomas, R., Hsu, A., Weinfurter, A. (2020). "Replication Data for: Sustainable and Inclusive - Evaluating urban sustainability indicators' suitability for measuring progress towards SDG-11". https://doi.org/10.7910/DVN/30FLEB, Harvard Dataverse.
